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(54) System for composing multimedia signals for interactive television services. 

(57) A system for composing multimedia signals 
from various source signals, such as still-frame 
video signals, full motion video signals and 
audio signals includes a plurality of composing 
units (314), each of which is capable of compos- 
ing (in 440) a multimedia signal from selected 
source signals. One of such composing units 
(314) is assigned for each multimedia signal to 
be composed. Each composing unit (314) ac- 
cepts parameters specifying various properties 
for each element of the multimedia signal, such 
as size, position and precedence for each video 
element and gain for each audio element. The 
composing units can include decompressors 
(418,420,422) for decompressing source signals 
furnished in compressed form and a compres- 
sor (444,446) to compress the multimedia signal 
to be transmitted to a user. The source signals 
can be received from a packet network and the 
resulting multimedia signals transmitted to the 
respective users via the same packet network. 
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Technical Field 

This invention relates to interactive television 
systems, and more particularly to systems for gener- 
ating multimedia signals from various video and au- 
dio sources for transmission to users in connection 
with furnishing interactive television services. 

Background of the Invention 

Cable television systems have made possible the 
transmission of many channels of television pro- 
grams to the homes of subscribers. Instead of being 
limited to the number of VHF and UHF channels that 
can be transmitted and received in a given area, the 
number of channels in cable systems is limited only 
by the transmission characteristics of the cable itself 
and the ability to compress the information in televi- 
sion programs into narrower- bandwidth channels. 
The advent of fiber-optic transmission systems for 
use in cable television networks has also vastly in- 
creased the number of channels available. 

Such increase in channels has also given rise to 
proposals for interactive television systems wherein a 
subscriber can transmit information or requests back 
into the system, which information or requests may 
subsequently affect programs or information directed 
to such subscriber. There are a wide variety of appli- 
cations for interactive television systems, such as vid- 
eo games, video catalog shopping, teaching systems, 
movies on demand and audio programs. Each appli- 
cation can be tailored for an individual subscriber, for 
example a subscriber may be able to select the lan- 
guage of the soundtrack in a movie. However, such 
systems typically require the ability (i) to control spe- 
cific programs or information sent to each subscriber 
and (ii) to receive input messages or requests from 
the subscriber. 

A headend-to-subscriber channel does not need 
the full bandwidth of the usual television cable chan- 
nel if compression can be used. For example if a tel- 
evision signal is digitized and compressed in accor- 
dance with the MPEG standard, it is possible to trans- 
mit sixteen television programs digitally over a single 
conventional 6 Mhz cable channel. Other program in- 
formation, such as high-fidelity audio, still video pic- 
tures or text can also be sent in compressed form. By 
using a large number of conventional channels for ITV 
purposes, such as is now possible with optical fiber 
distribution systems, together with compression, it is 
conceivable that hundreds of virtual channels could 
be made available. 

Systems are also becoming available for trans- 
mitting compressed and encoded television signals 
over telephone circuits. In one such system, 1 .5 meg- 
abits/second of data, sufficient for one compressed 
and encoded television channel, can be transmitted 
from a telephone central office over a twisted-pair 



loop to a subscriber's premises, and 16 kilobits of 
data can be transmitted back to the central office, to- 
gether with regular telephone service. Such a loop is 
known as an asymmetric digital subscriber loop. 

5 With the wide deployment of digital communica- 

tion systems, standardized digital transmission sys- 
tems have become available in which data is trans- 
mitted in packet form over networks. One such net- 
work is an asynchronous transmission mode (ATM) 

10 network in which packets of varying length can be 
sent. Such packet networks can be used for carrying 
digitized television signals. 

Clearly the means now exist by which tailored tel- 
evision signals can be delivered to individual users, 

15 such as by cable television systems, telephone net- 
works, packet networks and satellites. Also, a large 
number of sources are available from which pro- 
grams and interactive television applications can be 
provided. However, problems arise in the efficient 

20 generation of television signals composed of signals 
from multiple video and/or audio sources for use in 
providing interactive television services. 

Summary of the Invention 

25 

The system of the invention composes multime- 
dia signals from various source signals, such as still- 
frame video signals, full motion video signals and au- 
dio signals. The system includes a plurality of com- 

30 posing units each of which is capable of composing 
a multimedia signal from selected source signals. 
One of such composing units is assigned for each 
multimedia signal to be composed. Each composing 
unit accepts parameters specifying various proper- 

35 ties for each element of the multimedia signal, such 
as size, position and precedence for each video ele- 
ment and gain for each audio element. 

Each composing unit can include decompressors 
for decompressing source signals furnished in com- 

40 pressed form and a compressor to compress the mul- 
timedia signal to be transmitted to the user. 

The source signals can be received from a packet 
network and the resulting multimedia signals trans- 
mitted to the respective users via the same packet 

45 network. 

These and other aspects of the invention will be- 
come apparent from the drawings and the detailed 
description. 

so Brief Description of the Drawing 

FIG. 1 is an overall block diagram of an integrated 
television services system using the multimedia 
frame of the invention. 
55 FIG. 2 is a block diagram showing the overall flow 

of control and data packets in the system of FIG. 1. 

FIG. 3 is an overall blockdiagram of the multime- 
dia frame of the invention. 
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FIG. 4 is a block diagram of a composing unit 
used in the multimedia frame of the invention. 

FIG. 5 is a diagram of a multimedia menu screen 
that can be composed with the system of the inven- 
tion. 

Detailed Description 

FIG. 1 is an overall block diagram of a basic inte- 
grated television services system in which the multi- 
media frame of the invention can be used. The ele- 
ments of the system communicate through packet 
network 10, which includes one or more packet 
switches 120. Such elements include control proces- 
sor 100, application processor 102, application re- 
sources such as data cache 104, multimedia frame 
106, broadcast source 108 and game frame 110 and 
one or more distribution interfaces 112 and 114, all of 
which are connected by links 140 to packet switch 
120. Program library 105 is a high- capacity, long- 
term source of stored program material for data cache 
104. Broadcast source 108 furnishes digitized and 
compressed program material from television broad- 
casts, cable systems, satellites or other means. In an 
exemplary embodiment, packet network 10 is an 
asynchronous transfer mode (ATM) network, links 
140 are SONET OC-3 links, and packet switch 120 is 
an ATM switch such as the AT&T GCNS- 2000 ATM 
switch. 

In general, control processor 101 serves to inter- 
act with users selecting any of the various interactive 
services that may be provided by the system of the in- 
vention, and application processor 102 controls the 
services themselves. Control processor 101 also as- 
signs the resources necessary to provide a service to 
a user. Application processor 102 and application re- 
sources such as data cache 104, multimedia frame 
106, broadcast source 108 and game frame 110 can 
be duplicated as necessary to meet the capacity re- 
quirements of the system, but control processor 101 
is preferably not duplicated because such processor 
maintains assignment records for the various resourc- 
es in the system and making such records available 
to more than one processor, while achievable, is cum- 
bersome. However, these task assignments to con- 
trol processor 101 and application processor 102 are 
not inviolate; for example, in some cases it may be de- 
sirable to combine the functions of both processors in 
a single processor, or to use control processor 101 for 
certain applications. 

Data cache 1 04 is a large random access mem- 
ory for storing audio and video material in com- 
pressed and packetized form, such as the program 
server described in our copending patent application 
Serial No. 07/997,985 filed December 29, 1992 and 
having the same assignee as this invention. Data 
cache 104 can have one or more links 140 into packet 
network 10 depending on capacity requirements. 



Program library 105 can be a large-capacity system 
of any convenient type for storing program material, 
such as movies, musical selections, video clips, still 
frames and audio clips, in digitized and compressed 

5 form on devices such as magnetic tapes or optical 
discs and the like. Link 142 between program library 
105 and data cache 104 can also be made via a sep- 
arate ATM packet network, such as a SONET OC-3 
network, or via packet network 10, depending on ca- 

10 pacity and geographic limitations. Program library 
105 is typically equipped to transmit a large quantity 
of data, such as digitized and compressed video and 
audio signals for complete movies, to data cache 1 04 
in a relatively short time, as described in more detail 

15 in our above mentioned application. 

It is contemplated that the various elements 
shown in FIG. 1 can be situated at different geograph- 
ic locations. For example, packet switch 120, control 
processor 100, application processor 102, multime- 

20 dia frame 106, broadcast source 108 and game frame 
110 can be at a first location, data cache 104 can be 
at a second location and program library 105 can be 
at a third location. Other packet switches 120 and dis- 
tribution interfaces such as 112 and 114 can be at still 

25 other locations. Connections between locations can 
be SONET OC-3 links. It will be clear to those skilled 
in the art that numerous possible combinations of and 
locations for such elements are possible without de- 
parting from the spirit and scope of the invention. 

30 Many interactive television services may be of- 

fered that incorporate broadcast television programs. 
Therefore, one of the application resources provided 
in the system of the invention can be a source of such 
programs, such as broadcast source 108, which re- 

35 ceives one or more broadcast television signals from 
regular broadcasts, satellite transmissions, cable tel- 
evision systems or any other appropriate means, all 
symbolized by antenna 109. Broadcast TV source 
108 includes facilities for digitizing and compressing 

40 each received television signal and transmitting such 
signals in packetized form via packet network 10. 

Multimedia frame 106 contains a number of units 
for composing compressed and packetized multime- 
dia signals from a number of different sources, such 

45 as data cache 1 04, broadcast source 1 08 and internal 
text generators. Each such source typically provides 
a signal in digitized and compressed form. When a 
multimedia signal composed from multiple video 
and/or audio sources is to be transmitted to a user, 

so control processor 102 assigns one of such composing 
units and the necessary virtual channels to create the 
multimedia signal. Note that the input signals to mul- 
timedia frame 1 06, as well as output signals to users, 
are all transmitted in packetized form through packet 

55 network 10. Multimedia frame 106 will be described 
in more detail below. 

Game frame 110 contains a number of units for 
generating compressed and packetized audio and 



3 



5 



EP 0 624 040 A2 



6 



video signals for use in computer games that may be 
offered as part of an interactive television service. 
One such unit is assigned to each user of a computer 
game. 

Application processor 102 and applications re- 
sources 104, 106, 108 and 110 can be combined in 
various ways to form what can be thought of as appli- 
cation servers for providing different ITV services. 
For example, a video-on-demand / enhanced-pay- 
per-view server could include application processor 
102 and data cache 104; a multimedia ITV server 
could include application processor 102, data cache 
1 04, multimedia frame 1 06 and broadcast source 1 08 
and a game server could include application proces- 
sor 102, multimedia frame 106 and game frame 110. 
Each application processor 102 can be part of more 
than one such server, up to the limit of its capacity. 

Interfaces such as cable distribution interface 
112 are typically located at cable television system 
headends and connected to signal converters 1 30 sit- 
uated on cable customers' premises by distribution 
cable 116, whereby the signal converter 1 30 for each 
cable customer receives the same overall "down- 
stream" signals broadcast to all cable customers 
served by the same cable distribution interface 112. 
However, signal converters 130 can be enabled by 
control packets transmitted over cable 116 to receive 
only certain specified "virtual" channels from cable 
116. 

Interfaces such as telephone distribution inter- 
face 114 are typically located at telephone central of- 
fices and connected to signal converters 1 50 situated 
on telephone subscribers' premises by subscriber 
loops 118, whereby each telephone subscriber re- 
ceives a different downstream signal. 

Signal converters 130 and 150 convert the down- 
stream signals into conventional television signals for 
display on television receivers 134. Users can send 
"upstream" signals to control processor 100 and ap- 
plications processors 102 from input means such as 
remote control 1 32 that transmit control messages to 
signal converters 130 or 150, as the case may be. 
Other devices, such as processors, printers, video 
cameras, telephones and other input and output de- 
vices can be connected to signal converters 130 and 
1 50 as required for the provision of ITV services. 

The various data and control packets transmitted 
through the packet network each contain a header 
and a body. The header defines a "virtual channel" in 
which the packet is deemed to be transmitted. In data 
packets, the body contains data, such as part of a 
digitized and compressed audio or video signal. In 
control packets, the body contains information such 
as the address of the unit for which the packet is in- 
tended and an action to be performed. In the exem- 
plary embodiment, each packet contains five bytes in 
the header and 48 bytes in the body. The header can 
include information such as polling addresses and vir- 



tual channel identities. Control packets are typically 
sent in one or more specifically assigned virtual chan- 
nels, for example, channel "0." 

For each link 140 connected to packet switch 120, 

5 packet switch control 121 stores information specify- 
ing how to route packets arriving over such link. Such 
information is based on header contents; for exam- 
ple, certain virtual channels and/or certain polling ad- 
dresses can be identified for transmission over a link. 

10 A packet arriving at packet switch 1 20 over any link is 
then routed to one or more links 140 depending on its 
header contents. Packet switch control 121 can itself 
receive packets from packet network 10, and, in the 
system of the invention, control processor 1 00 sends 

15 control packets as needed to packet switch control 
121 containing such identifying information for each 
link 140. 

In the exemplary embodiment, a television signal 
is made up of three separate data signals, one video 

20 and two audio (for stereophonic sound). Each signal 
is separately digitized and compressed and sent 
through packet network 120 as a packet stream in a 
separate virtual channel. Accordingly, three virtual 
channels are used for transmission of a single televi- 

25 sion signal. However, other embodiments are possi- 
ble in which multiple audio channels are encoded and 
compressed together, such as the Dolby (r) AC-2 sys- 
tem. 

FIG. 2 shows typical flows of data and control 

30 packets through packet network 1 0 and in other parts 
of the system. During the furnishing of ITV services, 
data packets containing audio and/or video informa- 
tion flow from application resources such as data 
cache 104, multimedia frame 106 and game frame 

35 110 to signal converters 130 and 150. Such data 
packets can also flow between application resourc- 
es, such as from data cache 1 04 to multimedia frame 
106 and game frame 110 and from broadcast source 
1 08 to multimedia frame 1 06. During interactions with 

40 users, control packets flow between signal conver- 
ters 130 and 150 and control processor 100 and/or 
between signal converters 130 and 150 and applica- 
tion processor 102. Control processor 100 interacts 
with application processor 1 02 and with packet switch 

45 control 121 by means of control packets, and appli- 
cation processor 102 interacts with application re- 
sources 104, 105, 106, 108 and 110 by means of con- 
trol packets. Data packets from program library 105 
to data cache 104 can be sent via a direct link or via 

so a packet network; even via packet network 10, if con- 
venient. The link between application processor 102 
and program library 105 carries only control packets 
and can be of lower capacity than the links 140, for 
example, such link can be part of an Ethernet (r) net- 

55 work. However, such I ink can also be a low-bandwidth 
virtual channel in packet network 10, if desired. 

The flexibility of packet network 10 makes pos- 
sible flows of data and control packets between any 
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pair of terminals and from one terminal to groups of 
terminals. Possibly useful paths for control packets 
that are not shown are between signal converters 1 30 
and 1 50 on the one hand to multimedia frame 1 06 and 
game frame 1 1 0 on the other. These paths may be de- 
sirable for use in time-dependent interactive applica- 
tions using multimedia frame 106 and game frame 
110 in which an extremely fast response to an action 
by a user is desired. As will be described, multimedia 
frame 106 contains processors that can generate and 
respond to such control packets, and game frame 1 1 0 
typically will also contain such processors. 

In general, control processor 100 interacts with 
users to control requested services and to assign the 
various resources needed to furnish the services, in- 
cluding the necessary virtual channels in packet net- 
work 1 0 and the various other I inks in the system. Dur- 
ing the delivery of a service by application resources 
101. 106 and 110, under control of application proc- 
essor 102, control processor 100 maintains supervi- 
sion by responding to certain control packets from 
signal converters 130 and 150; application processor 
102 responds to control packets from signal conver- 
ters 130 and 150 specifically relating to an ITV ser- 
vice being delivered. However, as described above, 
application resources may be equipped to respond di- 
rectly to control packets from signal converters 130 
and 150. 

FIG. 3 is an overall block diagram of multimedia 
frame 106, which includes a control and demultiplex- 
er (DEMUX) unit 310, a number of composing units 
314 and a multiplexer unit 316. Control and demulti- 
plexer 310 routes packets from packet network 1 0 for 
the various components to be included in multimedia 
signals via links 312 to the various composing units 
314. Multiplexer 316 accepts packets for the various 
multimedia signals via links 318 from composing 
units 314 and transmits such packets over packet 
network 1 0. 

FIG. 4 is an overall block diagram of composing 
unit 314. Data dispatcher 410 receives packets from 
control and demultiplexer 310 via link 312 and routes 
such packets via bus412 to various elements such as 
text generator 414, cursor generator 416, still-frame 
video decompressors 418, full-motion video decom- 
pressors 420, audio decompressors 422 and control 
424. In general, text generator 414, cursor generator 
416 and control 424 receive control packets and still- 
frame video decompressors 418, full-motion video 
decompressors 420 and audio decompressors 422 
receive data packets. 

The output of each video decompressor 41 8 and 
420 is connected to the input of a corresponding video 
scaler and positioner 430 or 432, the outputs of which 
are connected to video composer 440. The output of 
each audio decompressor 422 is connected to the in- 
put of a corresponding audio scaler 434, the outputs 
of which are connected to audio mixer 442. Video 



composer 440 composes full-motion video frames in 
digital form from the various outputs from scalers and 
positioners 430 and 432 and video compressor 444 
compresses such frames into data packets for trans- 

5 mission via multiplexer 450, link 318 and multiplexer 
316 (FIG. 3) to packet network 10. Audio mixer 442 
combines audio signals from audio scalers 434 and 
audio compressor 446 compresses such frames into 
data packets for transmission, again via multiplexer 

10 450, link 318 and multiplexer 316 to packet network 
1 0. Multiplexers 450 and 316 can be combined, if con- 
venient. 

Text generator 414 generates text for display on 
a user's television screen in response to control pack- 

15 ets received from data dispatcher 4 10. Text generator 
414 can include facilities for producing different type 
styles and other displays such as icons and graphic 
symbols, and can be arranged to place text and other 
display material at any location on the screen, as is 

20 well known in the art. Control packets for text gener- 
ator 41 4 typically originate in control processor 100 or 
application processor 102; however, for complex text 
messages that are used often, it may be desirable to 
store such packets in data cache 104. 

25 Cursor generator 416 generates a cursor for dis- 

play on a user's television screen when required by a 
particular ITV service. The cursor symbol to be used 
and its position on the screen are specified by control 
packets received via data dispatcher 410. The posi- 

30 tion of a cursor is typically determined by interaction 
with a user operating an input device such as a 
mouse, a trackball or the like connected to signal con- 
verter 130 or 150. The cursor position can be associ- 
ated with locations of letters or symbols in text, or with 

35 the position of other elements on the screen, as de- 
sired. As mentioned above, control packets from a 
user during an application are typically routed to ap- 
plication processor 102. However, because of the 
highly time- dependent nature of cursor movement, 

40 this is an example of a situation in which it may be de- 
sirable for control packets from a user to bypass ap- 
plication processor 102 and be routed directly from 
packet network 10 to multimedia frame 106. Such 
packets then proceed through control and demulti- 

45 plexer 310 to data dispatcher 410 and cursor gener- 
ator 416 in the composing unit 314 assigned to the 
user. 

An alternative location for text generator 414 
and/or cursor generator 41 6 is in signal converter 130 

so or 1 50 at the user's location. In such a configuration, 
control packets specifying text can be transmitted to 
the signal converter and control packets specifying 
cursor position can be transmitted by the signal con- 
verter. The choice is a tradeoff based on the cost of 

55 providing such units for each user in comparison to 
the cost of fewer, shared units in composing unit 314 
and on differences in control- packet transmissions 
required. 
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Still-frame video decompressors 418 receive 
data packets containing compressed still-frame vid- 
eo information in a format such as the well-known 
JPEG-standard format. Each decompressor 41 8 typ- 
ically includes decompression logic and a frame buf- 
fer capable of storing the resulting uncompressed 
data for all pixels in a full video frame, for example, 
the C-Cubed CL550 circuit. If desired, such decom- 
pressors can also be provided with smallerf rame buf- 
fers for partial frames. The origin of such data packets 
will typically be data cache 104. 

Similarly, full-motion video decompressors 420 
receive data packets containing compressed full- 
motion video information in a format such as the well- 
known MPEG-standard format. Each decompressor 
420 typically includes decompression logic and one 
or more frame buffers each capable of storing the un- 
compressed data for all pixels in a full video frame, for 
example the C-Cubed CL450 circuit. Again, if desired, 
such decompressors can also be provided with small- 
er frame buffers for partial frames. The origin of such 
data packets will typically be data cache 104 or 
broadcast source 108. 

Audio decompressors 434 receive data packets 
containing compressed audio information in a format 
such as the Dolby AC-2 format. The output of audio 
decompressor 422 is typically one or more audio sig- 
nals in digital form. If the particular audio compression 
format used does not provide for multiple channels, 
then a decompressor 422 can be used for each audio 
channel desired. Again, the origin of such data pack- 
ets will typically be data cache 104 or broadcast 
source 108. 

Video scalers and positioners 430 and 432 each 
have the ability to accept full video frames in digitized 
form from decompressors 418 and 420, reduce the 
scale of such frames and reposition reduced frames 
at specified locations in a full-size frame. The amount 
of the scale reduction and the position are specified 
by control packets received by control 424, which for- 
wards the necessary control information to scalers 
and positioners 430 and 432 over connections not 
shown. Thus, the output of each scaler and positioner 
430 and 432 is a digitized full video frame or se- 
quence of frames, as the case may be, with the 
scaled video information at the specified position and 
the remainder of the frame or frames being black. A 
specific device that can be used is the Raytheon TMC 
2302 circuit. 

Audio scalers 434 control the gain of the audio 
signals from decompressors 422 by means of signals 
from control 424 in response to control packets. Scal- 
ers 434 are provided to correspond with the number 
of audio channels from decompressors 422, and can 
be digital or analog, depending on the output from de- 
compressors 422. If multiple audio channels are pro- 
vided, the apparent position of an audio source can be 
shifted by appropriate changes in the gain for each 



channel. 

Those skilled in the art will realize that there are 
many ways in which elements shown in FIG. 4 can be 
combined. For example, a video decompressor 41 8 or 

5 420 and its corresponding scaler and positioner 430 
or 432 could be combined. Similarly, an audio decom- 
pressor 422 might be combined with its correspond- 
ing scaler 434. Also, because of the duplication of 
similar units, the opportunity exists to share various 

10 functions among like elements, to the extent that ca- 
pacity and timing requirements permit. 

Video composer 440 combines the outputs from 
video scalers and positioners 430 and 432 (and, if 
provided, text generator 414 and cursor generator 

15 416) into a single video frame. One device that can be 
used for such purpose is described in application Ser- 
ial No. 08/029205 filed on March 10, 1993, in which 
each input is considered, on a pixel by pixel basis, as 
a layer with a given precedence and layers with high- 

20 er precedence mask those with lower precedence. 
Video composer 440 can also include facilities to pro- 
duce special effects such as zooms, fades, wipes and 
other effects well known in the art. 

Audio mixer 442 combines the outputs from au- 

25 dio scalers 434 to form the desired number of audio 
signals, for example, two audio signals if two-channel 
stereophonic audio is desired. Again, mixer 442 can 
be analog or digital, depending on the outputs from 
scalers 434. 

30 Video compressor 444 and audio compressor 

446 receive the output signals from video composer 
440 and audio mixer 442, respectively, and compress 
such signals into the desired format for transmission 
as data packets over packet network 10. Compres- 

35 sors 444 and 446 can insert the appropriate header 
information into such packets based on information 
received from control 424 over connections not 
shown. The compression formats used by compres- 
sors 444 and 446 do not necessarily have to be the 

40 same as those used on the input side; in fact, com- 
pressors 444 and 446 can be configured to have se- 
lectable formats so that a format desired or needed 
by the user can be used. Again, such format selection 
can be governed from control 424 in response to re- 

45 ceived control packets. 

Control 424 is typically a microprocessor, such as 
the AMD 29030 microprocessor, with the necessary 
stored program and random-access memory needed 
to respond to control packets and to send the neces- 

50 sary control signals to the various elements in com- 
posing unit 314. Control 424 can also be arranged to 
transmit control packets over packet network 10, 
such as by a connection (not shown) to multiplexer 
450. 

55 In an exemplary embodiment, composing unit 

412 includes 2 still-frame video decompressors 418, 
one of which is designated by reference numeral 418, 
full-motion video decompressors 420, 5 two-channel 
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audio decompressors 422 and a corresponding num- 
ber of video scalers and positions 430 and 432 and 
audio scalers 434. Video composer 440 has the ca- 
pacity to handle layers, not including the inputs from 
text generator 414 and cursor generator 416. 5 

Composing unit 314 has been described assum- 
ing that the input data packets contain compressed 
video and audio information. From a practical stand- 
point, compression is likely to be used because of the 
significant bandwidth and memory reductions possi- 10 
ble. However, it should be clear to those skilled in the 
art that the principles of the invention can be applied 
to systems in which such compression, and corre- 
sponding decompression, are not used, or are used 
only in portions of the system. 15 

Composing units similar to 314 but including dif- 
ferent complements of resources can be used where 
appropriate. For example, for applications in which 
only an additional audio signal need be superi mposed 
on an existing audio signal, or text on a video signal, 20 
composing units can be configured especially for 
such purposes. Composing unit 314 can be thought 
of as a universal unit capable of meeting many differ- 
ent needs. 

Use of the invention in composing a multimedia 25 
menu screen for an interactive television system will 
now be described. Such a menu screen can be dis- 
played to a user first gaining access to the system on 
the user's television set, and is an example of a mul- 
timedia program composed by a composing unit 312. 30 
Referring to FIG. 5, menu screen 500 consists of 
three scaled-down full-motion video images 520, 530 
and 540 displayed in window overlays on a still-frame 
background 510 and with superimposed text 550. Dis- 
play of such a screen is typically accompanied by ap- 35 
propriate background audio, such as a stereophonic 
musical selection, and may also include an introduc- 
tory voice message, such as a greeting with brief in- 
structions on how to proceed with selection of a ser- 
vice. 40 

Menu screen 500, can be transmitted to signal 
converters 130 and 150 for users not currently using 
ITV services (inactive users). Control processor 100 
receives control packets from such signal converters. 
A generic menu screen is somewhat analogous to dial 45 
tone in a telephone system, and can be "broadcast" 
to multiple users by transmitting the audio and video 
portions of the screen over dedicated virtual channels 
in packet network 10 and enabling inactive users to 
receive such dedicated virtual channels when they 50 
turn on their television receivers and select the ITV 
channel. However, when the menu screen is accom- 
panied by an introductory voice message that is cued 
to begin when the user selects the ITV channel, for 
example, by means of a control packet sent by signal 55 
converter 130 or 150, such voice message can be su- 
perimposed on the generic menu screen by means of 
another composing unit configured for that purpose. 



To generate menu screen 500, control processor 
100 assigns a composing unit 314 in multimedia 
frame 106 and virtual channels in packet network 10 
for routing the appropriate audio and video signals for 
the elements in the menu screen, in digitized and 
compressed form, to the assigned composing unit 
314. The full-motion video packets for the movie ex- 
cerpt to be shown in window 520, the game teaser to 
be shown in window 540 and the audio packets for the 
background music and introductory voice message 
are stored in data cache 104. The video signal for the 
shopping channel to be shown in window 530 is sup- 
plied by broadcast source 1 08. The video packets for 
background 510 are transmitted initially to one of still- 
frame video decompressors 418 and stored in a 
frame buffer in such decompressor. The text is gen- 
erated by text generator 414 in composing unit 314. 
Virtual channels are also assigned for the output of 
composing unit 314. 

Control processor 100 also assigns the elements 
inside the assigned composing unit 314 needed for 
the menu screen and initializes the various scalers 
and positioners in such composing unit by transmit- 
ting control packets addressed to such composing 
unit. Control 424 in the composing unit then transmits 
the appropriate control signals to the assigned ele- 
ments. For example, for menu screen 500, one still- 
frame video decompressor 418 and its associated 
scaler and positioner are needed for background 500, 
with the scaler and positioner set to fi II the screen with 
the output frame from such decompressor; three full- 
motion video decompressors and their associated 
scalers and positioners are needed for windows 720, 
730 and 740, with the scalers and positioners set to 
reduce the size of and position the windows as shown 
in FIG. 5; text generator 41 4 is needed for the text and 
two audio decompressors 422 and their associated 
scalers are needed for the introductory voice mes- 
sage superimposed on the musical background. The 
input to video composer 440 for background 510 is 
given the lowest precedence; the precedences for 
windows 520, 530 and 540 and for text 550 can be set 
arbitrarily as between themselves because there is 
no overlap of these elements. 

The invention has been shown and described 
with reference to particular embodiments. However, 
it will be understood by those skilled in the artthat va- 
rious changes may be made therein without departing 
from the spirit and scope of the invention. 



Claims 

1 . A system for composing multimedia signals from 
source signals, which comprises: 

a plurality of composing unit means, each 
said composing unit means being capable of 
composing a multimedia signal from selected 
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ones of said source signals; 

means for assigning one of said compos- 
ing unit means in response to a request for a mul- 
timedia signal; 

means for configuring said assigned com- 
posing unit means to compose said requested 
multimedia signal; 

means for applying said selected source 
signals to said assigned composing unit means in 
response to said request; and 

means for transmitting said multimedia 
signal to utilizing means. 

2. The system of claim 1 wherein said source sig- 
nals include video signals and each said compos- 
ing unit means comprises: 

at least one means for modifying a video 
signal in accordance with at least one video para- 
meter; 

said means for configuring comprises 
means for applying said video parameters to 
each said video modifying means; and 

said composing unit means further com- 
prises means for combining the outputs of said 
video modifying means into a composite video 
signal representing an overall image. 

3. The system of claim 2 in which at least one of said 
video parameters affects the size in said overall 
image of the image resulting from said source 
video signal. 

4. The system of claim 2 in which at I east one of said 
video parameters affects the position in said 
overall image of the image resulting from said 
source video signal. 

5. The system of claim 2 wherein at least one of said 
video signals is a still frame video signal. 

6. The system of claim 2 wherein at least one of said 
video signals is a full-motion video signal. 

7. The system of claim 1 wherein said source sig- 
nals include audio signals and each said compos- 
ing unit means comprises: 

at least one means for modifying an audio 
signal in accordance with at least one audio para- 
meter; 

said means for configuring comprises 
means for applying said audio parameter to each 
said audio modifying means; and 

said composing unit means further com- 
prises means for combining the outputs of said 
audio modifying means into a composite audio 
signal. 

8. The system of claim 7 in which at least one of said 



audio parameters affects the gain of said source 
audio signal. 

9. The system of claim 1 in which said at least one 
5 of said source signals is compressed and each 

composing unit means further comprises: 

at least one means for decompressing an 
applied source signal. 

10 10. The system of claim 1 in which said multimedia 
signal is compressed and each composing unit 
further comprises: 

means for compressing the multimedia 
signal composed by said composing unit. 
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